Search results for "exploratory data analysis"
showing 10 items of 16 documents
Tracking archaeological and historical mines using mineral prospectivity mapping.
2014
13 pages; International audience; The present study proposes a technological transfer from modern mining prospection to the field of archaeology, providing a methodology to facilitate the discovery of ancient mining sites. This method takes advantage of the thousands of geochemical analyses of streambed sediments, performed by national geological surveys to inventory mineral substances. In order to delineate geochemical anomalies, the datasets are treated following two different approaches: Exploratory Data Analysis and a fractal-based method often recognised as more powerful. Mineral prospectivity maps are then obtained by combining the results with a geographical information system. The s…
Development of Applications for Interactive and Reproducible Research: a Case Study
2016
For a proper understanding of the organization and regulation of gene expression, the computational analysis is an essential component of the scientific workflow, and this is particularly true in the fields of biostatistics and bioinformatics. Interactivity and reproducibility are two highly relevant features to consider when adopting or designing a tool, and often they can not be provided simultaneously.In this work, we address the issue of developing a framework that can provide interactive analysis, in order to allow experimentalists to fully exploit advanced software tools, as well as reproducibility as an internal validation of the analysis steps, by providing the underlying code and d…
Infrared biospectroscopy for a fast qualitative evaluation of sample preparation in metabolomics.
2014
Liquid chromatography-mass spectrometry (LC-MS) has been increasingly used in biomedicine to study the dynamic metabolomic responses of biological systems under different physiological or pathological conditions. To obtain an integrated snapshot of the system, metabolomic methods in biomedicine typically analyze biofluids (e.g. plasma) that require clean-up before being injected into LC-MS systems. However, high resolution LC-MS is costly in terms of resources required for sample and data analysis and care must be taken to prevent chemical (e.g. ion suppression) or statistical artifacts. Because of that, the effect of sample preparation on the metabolomic profile during metabolomic method d…
pcaExplorer: an R/Bioconductor package for interacting with RNA-seq principal components
2019
AbstractBackgroundPrincipal component analysis (PCA) is frequently useentirely written ind in genomics applications for quality assessment and exploratory analysis in high-dimensional data, such as RNA sequencing (RNA-seq) gene expression assays. Despite the availability of many software packages developed for this purpose, an interactive and comprehensive interface for performing these operations is lacking.ResultsWe developed the pcaExplorer software package to enhance commonly performed analysis steps with an interactive and user-friendly application, which provides state saving as well as the automated creation of reproducible reports. pcaExplorer is implemented in R using the Shiny fra…
MATLAB-based educational software for exploratory data analysis (EDA toolkit)
2009
This article presents an educational software developed in order to enable engineering students to gain insight into data sets via the exploratory data analysis (EDA). This software has been developed using the MATLAB GUIDE tool. This article shows the program suitability for learning EDA in different engineering courses related to data analysis such as data mining or data processing courses. © 2009 Wiley Periodicals, Inc. Comput Appl Eng Educ 20: 313–320, 2012
Empirical model evaluation and hypothesis testing
2016
Chapter 5 deals with the empirical model evaluation and the testing of hypotheses. It starts out with the evaluation of the measurement and the structural models, using the PLS algorithm. After the evaluation of the complete model, moderating effects are examined by conducting group comparisons (section 5.4.1) and by investigating interaction effects (5.4.2). After that, selected constructs are further examined by exploratory data analysis (section 5.5).
Basic Statistical Techniques
2012
Exploratory data analysis of environmental governance at local level in the south-west region of Poland
2018
Metabolic flux understanding of Pichia pastoris grown on heterogenous culture media
2014
[EN] Within the emergent field of Systems Biology, mathematical models obtained from physical chemical laws (the so-called first principles-based models) of microbial systems are employed to discern the principles that govern cellular behaviour and achieve a predictive understanding of cellular functions. The reliance on this biochemical knowledge has the drawback that some of the assumptions (specific kinetics of the reaction system, unknown dynamics and values of the model parameters) may not be valid for all the metabolic possible states of the network. In this uncertainty context, the combined use of fundamental knowledge and data measured in the fermentation that describe the behaviour…
Feature selection on a dataset of protein families: from exploratory data analysis to statistical variable importance
2016
Proteins are characterized by several typologies of features (structural, geometrical, energy). Most of these features are expected to be similar within a protein family. We are interested to detect which features can identify proteins that belong to a family, as well as to define the boundaries among families. Some features are redundant: they could generate noise in identifying which variables are essential as a fingerprint and, consequently, if they are related or not to a function of a protein family. We defined an original approach to analyze protein features for defining their relationships and peculiarities within protein families. A multistep approach has been mainly performed in R …